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10 I HYPOTHESIS 
TESTING WITH TWO 
SAMPLES 




Figure 10.1 If you want to test a claim that involves two groups (the types of breakfasts eaten east and west of the 
Mississippi River) you can use a slightly different technique when conducting a hypothesis test, (credit: Chloe Lim) 



Introduction 



Chapter Objectives 



By the end of this chapter, the student should be able to: 

• Classify hypothesis tests by type. 

• Conduct and interpret hypothesis tests for two population means, population standard deviations known. 

• Conduct and interpret hypothesis tests for two population means, population standard deviations unknown. 

• Conduct and interpret hypothesis tests for two population proportions. 

• Conduct and interpret hypothesis tests for matched or paired samples. 



Studies often compare two groups. For example, researchers are interested in the effect aspirin has in preventing heart 
attacks. Over the last few years, newspapers and magazines have reported various aspirin studies involving two groups. 
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Typically, one group is given aspirin and the other group is given a placebo. Then, the heart attack rate is studied over 
several years. 

There are other situations that deal with the comparison of two groups. For example, studies compare various diet and 
exercise programs. Politicians compare the proportion of individuals from different income brackets who might vote for 
them. Students are interested in whether SAT or GRE preparatory courses really help raise their scores. 

You have learned to conduct hypothesis tests on single means and single proportions. You will expand upon that in this 
chapter. You will compare two means or two proportions to each other. The general procedure is still the same, just 
expanded. 

To compare two means or two proportions, you work with two groups. The groups are classified either as independent or 
matched pairs. Independent groups consist of two samples that are independent, that is, sample values selected from one 
population are not related in any way to sample values selected from the other population. Matched pairs consist of two 
samples that are dependent. The parameter tested using matched pairs is the population mean. The parameters tested using 
independent groups are either population means or population proportions. 



NOTE 



This chapter relies on either a calculator or a computer to calculate the degrees of freedom, the test statistics, and 
p-values. TI-83+ and TI-84 instructions are included as well as the test statistic formulas. When using a TI-83+ or 
TI-84 calculator, we do not need to separate two population means, independent groups, or population variances 
unknown into large and small sample sizes. However, most statistical computer software has the ability to differentiate 
these tests. 



This chapter deals with the following hypothesis tests: 

Independent groups (samples are independent) 

• Test of two population means. 

• Test of two population proportions. 

Matched or paired samples (samples are dependent) 

• Test of the two population proportions by testing one population mean of differences. 

10.1 1 Two Population Means with Unknown Standard 
Deviations 

1. The two independent samples are simple random samples from two distinct populations. 

2. For the two distinct populations: 

° if the sample sizes are small, the distributions are important (should be normal) 

° if the sample sizes are large, the distributions are not important (need not be normal) 



The test comparing two independent population means with unknown and possibly unequal population standard 
deviations is called the Aspin- Welch t-test. The degrees of freedom formula was developed by Aspin- Welch. 



The comparison of two population means is very common. A difference between the two samples depends on both the 
means and the standard deviations. Very different means can occur by chance if there is great variation among the individual 

samples. In order to account for the variation, we take the difference of the sample means, X] - X 2 , and divide by the 

standard error in order to standardize the difference. The result is a t-score test statistic. 

Because we do not know the population standard deviations, we estimate them using the two sample standard deviations 
from our independent samples. For the hypothesis test, we calculate the estimated standard deviation, or standard error, 

of the difference in sample means, Xj - X 2 . 
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The standard error is: 



(*l ) 2 . (s 2 ) 

n . * n _ 



The test statistic (t-score) is calculated as follows: 



Ul~ x 2 )-Qu 1 -/i 2 ) 

\ j( s l ) 2 . ( s 2 ) 2 

y yij ' «2 



where: 

• si and S 2 , the sample standard deviations, are estimates of cri and 02 , respectively. 

• 01 and 01 are the unknown population standard deviations. 

• x 1 and x 2 are the sample means, pi and p 2 are the population means. 



The number of degrees of freedom ( df) requires a somewhat complicated calculation. However, a computer or calculator 
calculates it easily. The df are not always a whole number. The test statistic calculated previously is approximated by the 
Student's t-distribution with df as follows: 



Degrees of freedom 




When both sample sizes ni and 02 are five or larger, the Student's t approximation is very good. Notice that the sample 

2 2 

variances (si) and (S 2 ) are not pooled. (If the question comes up, do not pool the variances.) 



It is not necessary to compute this by hand. A calculator or computer easily computes it. 



Example 10.1 Independent groups 



The average amount of time boys and girls aged seven to 11 spend playing sports each day is believed to be the 
same. A study is done and data are collected, resulting in the data in Table 10.1. Each populations has a normal 
distribution. 





Sample 

Size 


Average Number of Hours Playing Sports Per 
Day 


Sample Standard 
Deviation 


Girls 


9 


2 


0.866 


Boys 


16 


3.2 


1.00 



Table 10.1 



Is there a difference in the mean amount of time boys and girls aged seven to 11 play sports each day? Test at the 
5% level of significance. 

Solution 10.1 

The population standard deviations are not known. Let g be the subscript for girls and b be the subscript for 
boys. Then, g g is the population mean for girls and gb is the population mean for boys. This is a test of two 

independent groups, two population means. 

Random variable: X g — X b = difference in the sample mean amount of time girls and boys play sports each 
day. 
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Ho:pg = pb Ho: iJg - iJb = o 
H a : pg * nb H a : Vg-Hb* 0 

The words "the same" tell you Ho has an Since there are no other words to indicate H a , assume it says "is 
different." This is a two-tailed test. 

Distribution for the test: Use tdf where df is calculated using the df formula for independent groups, two 
population means. Using a calculator, df is approximately 18.8462. Do not pool the variances. 

Calculate the p-value using a Student's t-distribution: p-value = 0.0054 
Graph: 




Figure 10.2 

s g = 0.866 
s h = 1 

So, x g - Jtj = 2- 3.2 = —1.2 

Half the p-value is below -1.2 and half is above 1.2. 

Make a decision: Since a > p-value, reject Ho. This means you reject p g = pb. The means are different. 

Q Using the TI-83, 83 + , 84, 84+ Calculator 

Press STAT. Arrow over to TESTS and press 4: 2-SampTTest. Arrow over to Stats and press ENTER. 
Arrow down and enter 2 for the first sample mean, V0.866 for Sxl, 9 for nl, 3 . 2 for the second sample 
mean, 1 for Sx2, and 16 for n2. Arrow down to pi: and arrow to does not equal p2. Press ENTER. 
Arrow down to Pooled: and No. Press ENTER. Arrow down to Calculate and press ENTER. The p-value 
is p = 0.0054, the dfs are approximately 18.8462, and the test statistic is -3.14. Do the procedure again but 
instead of Calculate do Draw. 



Conclusion: At the 5% level of significance, the sample data show there is sufficient evidence to conclude that 
the mean number of hours that girls and boys aged seven to 11 play sports per day is different (mean number of 
hours boys aged seven to 11 play sports per day is greater than the mean number of hours played by girls OR the 
mean number of hours girls aged seven to 11 play sports per day is greater than the mean number of hours played 
by boys). 



Try It 



10.1 Two samples are shown in Table 10.2. Both have normal distributions. The means for the two populations are 
thought to be the same. Is there a difference in the means? Test at the 5% level of significance. 
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Sample Size 


Sample Mean 


Sample Standard Deviation 


Population A 


25 


5 


1 


Population B 


16 


4.7 


1.2 



Table 10.2 



NOTE 

When the sum of the sample sizes is larger than 30 (ni + n 2 > 30) you can use the normal distribution to approximate 
the Student's t. 



Example 10.2 



A study is done by a community group in two neighboring colleges to determine which one graduates students 
with more math classes. College A samples 11 graduates. Their average is four math classes with a standard 
deviation of 1.5 math classes. College B samples nine graduates. Their average is 3.5 math classes with a standard 
deviation of one math class. The community group believes that a student who graduates from college A has 
taken more math classes, on the average. Both populations have a normal distribution. Test at a 1% significance 
level. Answer the following questions. 

a. Is this a test of two means or two proportions? 

Solution 10.2 

a. two means 

b. Are the populations standard deviations known or unknown? 

Solution 10.2 

b. unknown 

c. Which distribution do you use to perform the test? 

Solution 10.2 

c. Student's t 

d. What is the random variable? 

Solution 10.2 

d. X A -X B 

e. What are the null and alternate hypotheses? Write the null and alternate hypotheses in words and in symbols. 

Solution 10.2 

e. 



H 0 : fi A < Vb 
H a -VA> Vb 
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f. Is this test right-, left-, or two-tailed? 

Solution 10.2 

f. 




Figure 10.3 



right 



g. What is the p-value? 

Solution 10.2 

g. 0.1928 

h. Do you reject or not reject the null hypothesis? 

Solution 10.2 

h. Do not reject. 



i. Conclusion: 

Solution 10.2 

i. At the 1% level of significance, from the sample data, there is not sufficient evidence to conclude that a student 
who graduates from college A has taken more math classes, on the average, than a student who graduates from 
college B. 



10.2 A study is done to determine if Company A retains its workers longer than Company B. Company A samples 15 
workers, and their average time with the company is five years with a standard deviation of 1.2. Company B samples 
20 workers, and their average time with the company is 4.5 years with a standard deviation of 0.8. The populations are 
normally distributed. 

a. Are the population standard deviations known? 

b. Conduct an appropriate hypothesis test. At the 5% significance level, what is your conclusion? 



Example 10.3 



A professor at a large community college wanted to determine whether there is a difference in the means of final 
exam scores between students who took his statistics course online and the students who took his face-to-face 
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statistics class. He believed that the mean of the final exam scores for the online class would be lower than that 
of the face-to-face class. Was the professor correct? The randomly selected 30 final exam scores from each group 
are listed in Table 10.3 and Table 10.4. 



67.6 


41.2 


85.3 


55.9 


82.4 


91.2 


73.5 


94.1 


64.7 


64.7 


70.6 


38.2 


61.8 


88.2 


70.6 


58.8 


91.2 


73.5 


82.4 


35.5 


94.1 


88.2 


64.7 


55.9 


88.2 


97.1 


85.3 


61.8 


79.4 


79.4 



Table 10.3 Online Class 



77.9 


95.3 


81.2 


74.1 


98.8 


88.2 


85.9 


92.9 


87.1 


88.2 


69.4 


57.6 


69.4 


67.1 


97.6 


85.9 


88.2 


91.8 


78.8 


71.8 


98.8 


61.2 


92.9 


90.6 


97.6 


100 


95.3 


83.5 


92.9 


89.4 



Table 10.4 Face-to-face Class 



Is the mean of the Final Exam scores of the online class lower than the mean of the Final Exam scores of the 
face-to-face class? Test at a 5% significance level. Answer the following questions: 

a. Is this a test of two means or two proportions? 

b. Are the population standard deviations known or unknown? 

c. Which distribution do you use to perform the test? 

d. What is the random variable? 

e. What are the null and alternative hypotheses? Write the null and alternative hypotheses in words and in 
symbols. 

f. Is this test right, left, or two tailed? 

g. What is the p-value? 

h. Do you reject or not reject the null hypothesis? 

i. At the level of significance, from the sample data, there (is/is not) sufficient evidence to 

conclude that . 

(See the conclusion in Example 10.2, and write yours in a similar fashion) 

Using the TI-83, 83 + , 84, 84+ Calculator 



First put the data for each group into two lists (such as LI and L2). Press STAT. Arrow over to TESTS and 
press 4:2SampTTest. Make sure Data is highlighted and press ENTER. Arrow down and enter LI for the 
first list and L2 for the second list. Arrow down to pi: and arrow to / p2 (does not equal). Press ENTER. 
Arrow down to Pooled: No. Press ENTER. Arrow down to Calculate and press ENTER. 



NOTE 

Be careful not to mix up the information for Group 1 and Group 2! 



Solution 10.3 

a. two means 

b. unknown 

c. Student's t 
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d. X l -X 2 

e. 1. Ho : pi = p2 Null hypothesis: the means of the final exam scores are equal for the online and face-to- 

face statistics classes. 

2. H a : pi < p2 Alternative hypothesis: the mean of the final exam scores of the online class is less than 
the mean of the final exam scores of the face-to-face class. 

f. left-tailed 

g. p-value = 0.0011 



p-value = 



0 

Figure 10.4 

Reject the null hypothesis 

The professor was correct. The evidence shows that the mean of the final exam scores for the online class is 
lower than that of the face-to-face class. 

At the 5% level of significance, from the sample data, there is (is/is not) sufficient evidence to conclude that 
the mean of the final exam scores for the online class is less than the mean of final exam scores of the face- 
to-face class. 



Cohen's Standards for Small, Medium, and Large Effect Sizes 

Cohen's d is a measure of effect size based on the differences between two means. Cohen’s d, named for United States 
statistician Jacob Cohen, measures the relative strength of the differences between the means of two populations based on 
sample data. The calculated value of effect size is then compared to Cohen’s standards of small, medium, and large effect 
sizes. 



h. 

i. 




Size of effect 


d 


Small 


0.2 


medium 


0.5 


Large 


0.8 



Table 10.5 Cohen's 
Standard Effect 
Sizes 



Cohen's d is the measure of the difference between two means divided by the pooled standard deviation: 



d = 



X J - X 2 
s pooled 



where s pooled = 



in i - !)$]+ {n 2 - 1> 

n i + «2 - 2 



2 

2 
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Example 10.4 



Calculate Cohen’s d for Example 10.2. Is the size of the effect small, medium, or large? Explain what the size 
of the effect means for this problem. 

Solution 10.4 

pi = 4 si = 1.5 r?i = 11 
p 2 = 3.5 S2 = 1 H2 = 9 
d = 0.384 

The effect is small because 0.384 is between Cohen’s value of 0.2 for small effect size and 0.5 for medium effect 
size. The size of the differences of the means for the two colleges is small indicating that there is not a significant 
difference between them. 



Example 10.5 



Calculate Cohen’s d for Example 10.3. Is the size of the effect small, medium or large? Explain what the size 
of the effect means for this problem. 

Solution 10.5 

d = 0.834; Large, because 0.834 is greater than Cohen’s 0.8 for a large effect size. The size of the differences 
between the means of the Final Exam scores of online students and students in a face-to-face class is large 
indicating a significant difference. 



Try It 



10.5 Weighted alpha is a measure of risk-adjusted performance of stocks over a period of a year. A high positive 
weighted alpha signifies a stock whose price has risen while a small positive weighted alpha indicates an unchanged 
stock price during the time period. Weighted alpha is used to identify companies with strong upward or downward 
trends. The weighted alpha for the top 30 stocks of banks in the northeast and in the west as identified by Nasdaq on 
May 24, 2013 are listed in Table 10.6 and Table 10.7, respectively. 



94.2 


75.2 


69.6 


52.0 


48.0 


41.9 


36.4 


33.4 


31.5 


27.6 


77.3 


71.9 


67.5 


50.6 


46.2 


38.4 


35.2 


33.0 


28.7 


26.5 


76.3 


71.7 


56.3 


48.7 


43.2 


37.6 


33.7 


31.8 


28.5 


26.0 



Table 10.6 Northeast 



126.0 


70.6 


65.2 


51.4 


45.5 


37.0 


33.0 


29.6 


23.7 


22.6 


116.1 


70.6 


58.2 


51.2 


43.2 


36.0 


31.4 


28.7 


23.5 


21.6 


78.2 


68.2 


55.6 


50.3 


39.0 


34.1 


31.0 


25.3 


23.4 


21.5 



Table 10.7 West 



Is there a difference in the weighted alpha of the top 30 stocks of banks in the northeast and in the west? Test at a 5% 
significance level. Answer the following questions: 

a. Is this a test of two means or two proportions? 

b. Are the population standard deviations known or unknown? 

c. Which distribution do you use to perform the test? 

d . What is the random variable? 

e. What are the null and alternative hypotheses? Write the null and alternative hypotheses in words and in symbols. 
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f. Is this test right, left, or two tailed? 

g . What is the p- value? 

h. Do you reject or not reject the null hypothesis? 

i. At the level of significance, from the sample data, there (is/is not) sufficient evidence to conclude 

that . 

j. Calculate Cohen’s d and interpret it. 



10.2 | Two Population Means with Known Standard 
Deviations 



Even though this situation is not likely (knowing the population standard deviations is not likely), the following example 
illustrates hypothesis testing for independent means, known population standard deviations. The sampling distribution for 

the difference between the means is normal and both populations must be normal. The random variable is X^-X 2 . The 

normal distribution has the following format: 



Normal distribution is: 

x t - x 2 ~n 



AD - AD, 



(fft r ^ (*2) 



'i 



+ - 



n 2 



The standard deviation is: 



(<D) ^ (*2) 



+ ’ 

,V l 

The test statistic (z-score) is: 

7 = ( X t — X 2 ) — (p 1 - p 2 ) 

Dp 2 l DP 2 



Example 10.6 



Independent groups, population standard deviations known: The mean lasting time of two competing floor 
waxes is to be compared. Twenty floors are randomly assigned to test each wax. Both populations have a normal 
distributions. The data are recorded in Table 10.8. 



Wax 


Sample Mean Number of Months Floor Wax Lasts 


Population Standard Deviation 


1 


3 


0.33 


2 


2.9 


0.36 



Table 10.8 



Does the data indicate that wax 1 is more effective than wax 2? Test at a 5% level of significance. 

Solution 10.6 

This is a test of two independent groups, two population means, population standard deviations known. 

Random Variable: X j - X 2 = difference in the mean number of months the competing floor waxes last. 

Ho: pi < p 2 
H a : pi > p2 
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The words "is more effective" says that wax 1 lasts longer than wax 2, on average. "Longer" is a “>” symbol 
and goes into H a . Therefore, this is a right-tailed test. 



Distribution for the test: The population standard deviations are known so the distribution is normal. Using the 
formula, the distribution is: 







Since pt < p 2 then pt - p 2 < 0 and the mean for the normal distribution is zero. 

Calculate the p-value using the normal distribution: p-value = 0.1799 
Graph: 




0 0.1 

From H 0 : fi t - ^ 2 < 0 



Figure 10.5 

X 1 - X 2 =3-2.9 = 0.1 

Compare or and the p-value: a = 0.05 and p-value = 0.1799. Therefore, a < p-value. 

Make a decision: Since a < p-value, do not reject Ho. 

Conclusion: At the 5% level of significance, from the sample data, there is not sufficient evidence to conclude 
that the mean time wax 1 lasts is longer (wax 1 is more effective) than the mean time wax 2 lasts. 

Using the TI-83, 83 + , 84, 84+ Calculator 



Press STAT. Arrow over to TESTS and press 3:2-SampZTest. Arrow over to Stats and press ENTER. 
Arrow down and enter . 33 for sigmal, . 36 for sigma2, 3 for the first sample mean, 20 for nl, 2 . 9 for the 
second sample mean, and 20 for n2. Arrow down to pi: and arrow to > p 2 . Press ENTER. Arrow down to 
Calculate and press ENTER. The p-value is p = 0.1799 and the test statistic is 0.9157. Do the procedure 
again, but instead of Calculate do D raw. 



IhMIM 



♦ 



10.6 The means of the number of revolutions per minute of two competing engines are to be compared. Thirty engines 
are randomly assigned to be tested. Both populations have normal distributions. Table 10.9 shows the result. Do the 
data indicate that Engine 2 has higher RPM than Engine 1? Test at a 5% level of significance. 
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Engine 


Sample Mean Number of RPM 


Population Standard Deviation 


1 


1,500 


50 


2 


1,600 


60 



Table 10.9 



Example 10.7 



An interested citizen wanted to know if Democratic U. S. senators are older than Republican U.S. senators, on 
average. On May 26 2013, the mean age of 30 randomly selected Republican Senators was 61 years 247 days 
old (61.675 years) with a standard deviation of 10.17 years. The mean age of 30 randomly selected Democratic 
senators was 61 years 257 days old (61.704 years) with a standard deviation of 9.55 years. 

Do the data indicate that Democratic senators are older than Republican senators, on average? Test at a 5% level 
of significance. 

Solution 10.7 

This is a test of two independent groups, two population means. The population standard deviations are unknown, 
but the sum of the sample sizes is 30 + 30 = 60, which is greater than 30, so we can use the normal approximation 
to the Student’s-t distribution. Subscripts: 1: Democratic senators 2: Republican senators 

Random variable: X] - X 2 = difference in the mean age of Democratic and Republican U.S. senators. 

Ho: m < p 2 Ho: pi - ji 2 ^ 0 
H a : Jii > 112 H a : pi - p2 > 0 

The words "older than" translates as a “>” symbol and goes into H a - Therefore, this is a right-tailed test. 
Distribution for the test: The distribution is the normal approximation to the Student’s t for means, independent 

groups. Using the formula, the distribution is: X j - X 2 ~ N[ 0, — T ] 

Since pi < p 2 , pi - p 2 ^ 0 and the mean for the normal distribution is zero. 

(Calculating the p-value using the normal distribution gives p-value = 0.4040) 

Graph: 



Figure 10.6 




Compare or and the p-value: a = 0.05 and p-value = 0.4040. Therefore, a < p-value. 

Make a decision: Since a < p-value, do not reject Ho- 

Conclusion: At the 5% level of significance, from the sample data, there is not sufficient evidence to conclude 
that the mean age of Democratic senators is greater than the mean age of the Republican senators. 
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10.3 | Comparing Two Independent Population 
Proportions 

When conducting a hypothesis test that compares two independent population proportions, the following characteristics 
should be present: 

1. The two independent samples are simple random samples that are independent. 

2. The number of successes is at least five, and the number of failures is at least five, for each of the samples. 

3. Growing literature states that the population must be at least ten or 20 times the size of the sample. This keeps each 
population from being over-sampled and causing incorrect results. 

Comparing two proportions, like comparing two means, is common. If two estimated proportions are different, it may be 
due to a difference in the populations or it may be due to chance. A hypothesis test can help determine if a difference in the 
estimated proportions reflects a difference in the population proportions. 

The difference of two proportions follows an approximate normal distribution. Generally, the null hypothesis states that the 
two proportions are the same. That is. Ho: pa = PB ■ To conduct the test, we use a pooled proportion, p c . 

The pooled proportion is calculated as follows: 

X A+ X B 
PC n A + n B 

The distribution for the differences is: 

P'a ~ P's~N[ 0, \j Pc ( 1 - Pc)(^ + ±)1 

The test statistic (z-score) is: 

„ = (p'a - P' b> ~ ( Pa - Pb ) 

fPcO - Pc)(wx + 7^) 



Example 10.8 



Two types of medication for hives are being tested to determine if there is a difference in the proportions of 
adult patient reactions. Twenty out of a random sample of 200 adults given medication A still had hives 30 
minutes after taking the medication. Twelve out of another random sample of 200 adults given medication B 
still had hives 30 minutes after taking the medication. Test at a 1% level of significance. 



Solution 10.8 

The problem asks for a difference in proportions, making it a test of two proportions. 

Let A and B be the subscripts for medication A and medication B, respectively. Then pa and pe are the desired 
population proportions. 

Random Variable: 

P'a — P'b = difference in the proportions of adult patients who did not react after 30 minutes to medication A and 
to medication B. 

Ho: PA = PB 



PA-pB = 0 
H a : PA * PB 
PA-PB * 0 

The words "is a difference" tell you the test is two-tailed. 

Distribution for the test: Since this is a test of two binomial population proportions, the distribution is normal: 

_ x a + X B _ 20 + 12 _oo8 1 - n — 0 92 
Pc n A + n B 200 + 200 Pc 



P\ 



J '«~A/[o, ^(0.08)(0.92)(^ 



H — ) 

200 200 . 
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P'A -P'b follows an approximate normal distribution. 

Calculate the p-value using the normal distribution: p-value = 0.1404. 

Estimated proportion for group A: p' A — ^ = 0.1 

Estimated proportion for group B: p' B = 0.06 

Graph: 




Figure 10.7 



P'A -P'B = 0.1 -0.06 = 0.04. 

Half the p-value is below -0.04, and half is above 0.04. 

Compare a and the p-value: a = 0.01 and the p-value = 0.1404. a < p-value. 

Make a decision: Since a < p-value, do not reject Ho. 

Conclusion: At a 1% level of significance, from the sample data, there is not sufficient evidence to conclude that 
there is a difference in the proportions of adult patients who did not react after 30 minutes to medication A and 
medication B. 

Using the T!-83, 83 + , 8H, 8S+ Calculator 



Press STAT. Arrow over to TESTS and press 6 : 2 - PropZTest. Arrow down and enter 20 for xl, 200 
fornl, 12 for x2, and 200 for n2. Arrow down to pi: and arrow to not equal p2. Press ENTER. Arrow 
down to Calculate and press ENTER. The p-value is p = 0.1404 and the test statistic is 1.47. Do the 
procedure again, but instead of Calculate do Draw. 



Try It 



10.8 Two types of valves are being tested to determine if there is a difference in pressure tolerances. Fifteen out of 
a random sample of 100 of Valve A cracked under 4,500 psi. Six out of a random sample of 100 of Valve B cracked 
under 4,500 psi. Test at a 5% level of significance. 



Example 10.9 



A research study was conducted about gender differences in “sexting.” The researcher believed that the proportion 
of girls involved in “sexting” is less than the proportion of boys involved. The data collected in the spring of 2010 
among a random sample of middle and high school students in a large school district in the southern United States 
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is summarized in Table 10.9. Is the proportion of girls sending sexts less than the proportion of boys “sexting?” 
Test at a 1% level of significance. 





Males 


Females 


Sent “sexts” 


183 


156 


Total number surveyed 


2231 


2169 



Table 10.10 



Solution 10.9 

This is a test of two population proportions. Let M and F be the subscripts for males and females. Then pm and 
PF are the desired population proportions. 

Random variable: 

p'F ~ p'M = difference in the proportions of males and females who sent “sexts.” 

Ho: PF = PM Ho: PF~PM = 0 
H a : PF < PM H a : PF~PM< 0 
The words "less than" tell you the test is left-tailed. 

Distribution for the test: Since this is a test of two population proportions, the distribution is normal: 



= x F + x M = 156+183 

Ic n F + n M 2169 + 2231 

1 - p c = 0.923 



0.077 



Therefore, 

P'f - P m ~ Jv(o, ^(0.077)(0.923)(2 | L+2^ t )) 
P'f - p'M follows an approximate normal distribution. 



Calculate the p-value using the normal distribution: 

p-value = 0.1045 

Estimated proportion for females: 0.0719 
Estimated proportion for males: 0.082 

Graph: 



Figure 10.8 




Decision: Since a < p-value, Do not reject Ho 

Conclusion: At the 1% level of significance, from the sample data, there is not sufficient evidence to conclude 
that the proportion of girls sending “sexts” is less than the proportion of boys sending “sexts.” 
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Using the TI-83, 83 + , 84, 84+ Calculator 



Press STAT. Arrow over to TESTS and press 6:2-PropZTest. Arrow down and enter 156 for xl, 2169 for nl, 
183 for x2, and 2231 for n2. Arrow down to pi: and arrow to less than p2. Press ENTER. Arrow down to 
Calculate and press ENTER. The p-value is P = 0.1045 and the test statistic is z = -1.256. 



Example 10.10 



Researchers conducted a study of smartphone use among adults. A cell phone company claimed that iPhone 
smartphones are more popular with whites (non-Hispanic) than with African Americans. The results of the survey 
indicate that of the 232 African American cell phone owners randomly sampled, 5% have an iPhone. Of the 1,343 
white cell phone owners randomly sampled, 10% own an iPhone. Test at the 5% level of significance. Is the 
proportion of white iPhone owners greater than the proportion of African American iPhone owners? 



Solution 10.10 

This is a test of two population proportions. Let W and A be the subscripts for the whites and African Americans. 
Then pwand pa are the desired population proportions. 

Random variable: 

p'w —p' A = difference in the proportions of Android and iPhone users. 

Ho:pw = PA Ho:pw~PA = 0 



Ha', pw > PA H a : PW~PA> o 

The words "more popular" indicate that the test is right-tailed. 
Distribution for the test: The distribution is approximately normal: 
X W + X A_ 134+12 



yL n w + n A 1343 + 232 
1 - Pc = 0.9073 



0.0927 



Therefore, 

P'w-P' A " *(o, f(0.0927)(0.9073)( T 3L + J 2 )) 



Pw~ P a follows an approximate normal distribution. 

Calculate the p-value using the normal distribution: 



p-value = 0.0077 

Estimated proportion for group A: 0.10 
Estimated proportion for group B: 0.05 

Graph: 
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Figure 10.9 

Decision: Since a > p-value, reject the Ho. 

Conclusion: At the 5% level of significance, from the sample data, there is sufficient evidence to conclude that a 
larger proportion of white cell phone owners use iPhones than African Americans. 

Using the TI-83, 83 + , 8S, 89 + Calculator 



TI-83+ and TI-84: Press STAT. Arrow over to TESTS and press 6:2-PropZTest. Arrow down and enter 135 
for xl, 1343 for nl, 12 for x2, and 232 for n2. Arrow down to pi: and arrow to greater than p2. Press ENTER. 
Arrow down to Calculate and press ENTER. The P-value is P = 0.0092 and the test statistic is Z = 2.33. 



Try It 



10.10 A concerned group of citizens wanted to know if the proportion of forcible rapes in Texas was different in 2011 
than in 2010. Their research showed that of the 113,231 violent crimes in Texas in 2010, 7,622 of them were forcible 
rapes. In 2011, 7,439 of the 104,873 violent crimes were in the forcible rape category. Test at a 5% significance level. 
Answer the following questions: 

a. Is this a test of two means or two proportions? 

b. Which distribution do you use to perform the test? 

c. What is the random variable? 

d. What are the null and alternative hypothesis? Write the null and alternative hypothesis in symbols. 

e. Is this test right-, left-, or two-tailed? 

f. What is the p-value? 

g. Do you reject or not reject the null hypothesis? 

h. At the level of significance, from the sample data, there (is/is not) sufficient evidence to conclude that 



10.4 | Matched or Paired Samples 

When using a hypothesis test for matched or paired samples, the following characteristics should be present: 

1. Simple random sampling is used. 

2. Sample sizes are often small. 

3. Two measurements (samples) are drawn from the same pair of individuals or objects. 
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4. Differences are calculated from the matched or paired samples. 

5. The differences form the sample that is used for the hypothesis test. 

6. Either the matched pairs have differences that come from a population that is normal or the number of differences is 
sufficiently large so that distribution of the sample mean of differences is approximately normal. 

In a hypothesis test for matched or paired samples, subjects are matched in pairs and differences are calculated. The 
differences are the data. The population mean for the differences, p<j, is then tested using a Student's-t test for a single 
population mean with n— 1 degrees of freedom, where n is the number of differences. 

The test statistic (f-score) is: 

, x d~ Vd 

ten 

V Vn ) 



Example 10.11 



A study was conducted to investigate the effectiveness of hypnotism in reducing pain. Results for randomly 
selected subjects are shown in Table 10.10. A lower score indicates less pain. The "before" value is matched to 
an "after" value and the differences are calculated. The differences have a normal distribution. Are the sensory 
measurements, on average, lower after hypnotism? Test at a 5% significance level. 



Subject: 


A 


B 


C 


D 


E 


F 


G 


H 


Before 


6.6 


6.5 


9.0 


10.3 


11.3 


8.1 


6.3 


11.6 


After 


6.8 


2.4 


7.4 


8.5 


8.1 


6.1 


3.4 


2.0 



Table 10.11 



Solution 10.11 

Corresponding "before" and "after" values form matched pairs. (Calculate "after" - "before.") 



After Data 


Before Data 


Difference 


6.8 


6.6 


0.2 


2.4 


6.5 


-4.1 


7.4 


9 


-1.6 


8.5 


10.3 


-1.8 


8.1 


11.3 


-3.2 


6.1 


8.1 


-2 


3.4 


6.3 


-2.9 


2 


11.6 


-9.6 



Table 10.12 



The data for the test are the differences: {0.2, -4.1, -1.6, -1.8, -3.2, -2, -2.9, -9.6} 

The sample mean and sample standard deviation of the differences are: x~ d = -3.13 and s d — 2.91 Verify these 
values. 

Let /J d be the population mean for the differences. We use the subscript d to denote "differences." 

Random variable: X d = the mean difference of the sensory measurements 
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Ho: pd > 0 

The null hypothesis is zero or positive, meaning that there is the same or more pain felt after hypnotism. That 
means the subject shows no improvement, pd is the population mean of the differences.) 

H a : pd < 0 

The alternative hypothesis is negative, meaning there is less pain felt after hypnotism. That means the subject 
shows improvement. The score should be lower after hypnotism, so the difference ought to be negative to indicate 
improvement. 

Distribution for the test: The distribution is a Student's t with df=n - 1 = 8-1 = 7. Use tv. (Notice that the 
test is for a single population mean.) 

Calculate the p-value using the Student's-t distribution: p-value = 0.0095 
Graph: 



*d 




From H a : fi d >0 



Figure 10.10 

X d is the random variable for the differences. 

The sample mean and sample standard deviation of the differences are: 
x d =-3.13 

5 d = 2.91 

Compare or and the p-value: a = 0.05 and p-value = 0.0095. a > p-value. 

Make a decision: Since a > p-value, reject Ho. This means that pd < 0 and there is improvement. 

Conclusion: At a 5% level of significance, from the sample data, there is sufficient evidence to conclude that 
the sensory measurements, on average, are lower after hypnotism. Hypnotism appears to be effective in reducing 
pain. 



NOTE 

£ !p For the T1-83+ and Tl-84 calculators, you can either calculate the differences ahead of time (after - 
before) and put the differences into a list or you can put the after data into a first list and the before 
data into a second list. Then go to a third list and arrow up to the name. Enter 1 st list name - 2 nd list name. The 
calculator will do the subtraction, and you will have the differences in the third list. 




Jsinq the 



TI-83, 83 + , 89, 88+ Calculator 



Use your list of differences as the data. Press STAT and arrow over to TESTS. Press 2 : T-Test. Arrow 
over to Data and press ENTER. Arrow down and enter 0 for p 0 , the name of the list where you put the data, 
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and 1 for Freq:. Arrow down to |i: and arrow over to < /j 0 . Press ENTER. Arrow down to Calculate and 

press ENTER. The p-value is 0.0094, and the test statistic is -3.04. Do these instructions again except, arrow 
to Draw (instead of Calculate). Press ENTER. 




10.11 A study was conducted to investigate how effective a new diet was in lowering cholesterol. Results for the 
randomly selected subjects are shown in the table. The differences have a normal distribution. Are the subjects’ 
cholesterol levels lower on average after the diet? Test at the 5% level. 



Subject 


A 


B 


C 


D 


E 


F 


G 


H 


1 


Before 


209 


210 


205 


198 


216 


217 


238 


240 


222 


After 


199 


207 


189 


209 


217 


202 


211 


223 


201 



Table 10.13 



Example 10.12 



A college football coach was interested in whether the college's strength development class increased his players' 
maximum lift (in pounds) on the bench press exercise. He asked four of his players to participate in a study. 
The amount of weight they could each lift was recorded before they took the strength development class. After 
completing the class, the amount of weight they could each lift was again measured. The data are as follows: 



Weight (in pounds) 


Player 1 


Player 2 


Player 3 


Player 4 


Amount of weight lifted prior to the class 


205 


241 


338 


368 


Amount of weight lifted after the class 


295 


252 


330 


360 



Table 10.14 



The coach wants to know if the strength development class makes his players stronger, on average. 

Record the differences data. Calculate the differences by subtracting the amount of weight lifted prior to the class 
from the weight lifted after completing the class. The data for the differences are: {90, 11, -8, -8}. Assume the 
differences have a normal distribution. 

Using the differences data, calculate the sample mean and the sample standard deviation. 
x j = 21.3, sd = 46.7 



NOTE 

The data given here would indicate that the distribution is actually right-skewed. The difference 90 may be 
an extreme outlier? ft is pulling the sample mean to be 21.3 (positive). The means of the other three data 
values are actually negative. 



Using the difference data, this becomes a test of a single 



(fill in the blank). 



This content is available for free at https://cnx.Org/content/colll562/l.17 





CHAPTER 10 | HYPOTHESIS TESTING WITH TWO SAMPLES 



549 



Define the random variable: X d mean difference in the maximum lift per player. 

The distribution for the hypothesis test is t 3 . 

Ho: pd < 0, Ha : pd > 0 

Graph: 



p-value = 0.2150 

“ 

0 21.3 

Figure 10.11 

Calculate the p-value: The p-value is 0.2150 

Decision: If the level of significance is 5%, the decision is not to reject the null hypothesis, because a < p-value. 

What is the conclusion? 

At a 5% level of significance, from the sample data, there is not sufficient evidence to conclude that the strength 
development class helped to make the players stronger, on average. 




Try It 



10.12 A new prep class was designed to improve SAT test scores. Five students were selected at random. Their scores 
on two practice exams were recorded, one before the class and one after. The data recorded in Table 10.15. Are the 
scores, on average, higher after the class? Test at a 5% level. 



SAT Scores 


Student 1 


Student 2 


Student 3 


Student 4 


Score before class 


1840 


1960 


1920 


2150 


Score after class 


1920 


2160 


2200 


2100 



Table 10.15 



Example 10.13 



Seven eighth graders at Kennedy Middle School measured how far they could push the shot-put with their 
dominant (writing) hand and their weaker (non-writing) hand. They thought that they could push equal distances 
with either hand. The data were collected and recorded in Table 10.16. 



Distance (in feet) 
using 


Student 

1 


Student 

2 


Student 

3 


Student 

4 


Student 

5 


Student 

6 


Student 

7 


Dominant Hand 


30 


26 


34 


17 


19 


26 


20 



Table 10.16 
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Distance (in feet) 
using 


Student 

1 


Student 

2 


Student 

3 


Student 

4 


Student 

5 


Student 

6 


Student 

7 


Weaker Hand 


28 


14 


27 


18 


17 


26 


16 



Table 10.16 



Conduct a hypothesis test to determine whether the mean difference in distances between the children’s dominant 
versus weaker hands is significant. 

Record the differences data. Calculate the differences by subtracting the distances with the weaker hand from the 
distances with the dominant hand. The data for the differences are: {2, 12, 7, -1, 2, 0, 4}. The differences have a 
normal distribution. 

Using the differences data, calculate the sample mean and the sample standard deviation, x j = 3.71, s d = 4.5. 

Random variable: X d = mean difference in the distances between the hands. 

Distribution for the hypothesis test: t6 

Ho: pd = 0 H a : pd * 0 

Graph: 




Figure 10.12 

Calculate the p-value: The p-value is 0.0716 (using the data directly). 

(test statistic = 2.18. p-value = 0.0719 using (x d = 3.71, s d = 4.5. j 

Decision: Assume a = 0.05. Since a < p-value, Do not reject Ho. 

Conclusion: At the 5% level of significance, from the sample data, there is not sufficient evidence to conclude 
that there is a difference in the children’s weaker and dominant hands to push the shot-put. 
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10.13 Five ball players think they can throw the same distance with their dominant hand (throwing) and off-hand 
(catching hand). The data were collected and recorded in Table 10.17. Conduct a hypothesis test to determine whether 
the mean difference in distances between the dominant and off-hand is significant. Test at the 5% level. 





Player 1 


Player 2 


Player 3 


Player 4 


Player 5 


Dominant Hand 


120 


111 


135 


140 


125 


Off-hand 


105 


109 


98 


111 


99 



Table 10.17 



10.5 | Hypothesis Testing for Two Means and Two 
Proportions 
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Stats Lab 



10.1 Hypothesis Testing for Two Means and Two 
Proportions 

Class Time: 

Names: 

Student Learning Outcomes 

• The student will select the appropriate distributions to use in each case. 

• The student will conduct hypothesis tests and interpret the results. 

Supplies: 

• the business section from two consecutive days’ newspapers 

• three small packages of M&Ms® 

• five small packages of Reese's Pieces® 

Increasing Stocks Survey 

Look at yesterday’s newspaper business section. Conduct a hypothesis test to determine if the proportion of New York 
Stock Exchange (NYSE) stocks that increased is greater than the proportion of NASDAQ stocks that increased. As 
randomly as possible, choose 40 NYSE stocks, and 32 NASDAQ stocks and complete the following statements. 

1. Ho: 

2. H a : 

3. In words, define the random variable. 

4. The distribution to use for the test is . 

5. Calculate the test statistic using your data. 

6. Draw a graph and label it appropriately. Shade the actual level of significance. 

a. Graph: 



Figure 10.13 

b. Calculate the p-value. 

7. Do you reject or not reject the null hypothesis? Why? 

8. Write a clear conclusion using a complete sentence. 
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Decreasing Stocks Survey 

Randomly pick eight stocks from the newspaper. Using two consecutive days’ business sections, test whether the 
stocks went down, on average, for the second day. 

1. Ho: 

2. H a : 

3. In words, define the random variable. 

4. The distribution to use for the test is . 

5. Calculate the test statistic using your data. 

6. Draw a graph and label it appropriately. Shade the actual level of significance. 

a. Graph: 



Figure 10.14 

b. Calculate the p-value: 

7. Do you reject or not reject the null hypothesis? Why? 

8. Write a clear conclusion using a complete sentence. 

Candy Survey 

Buy three small packages of M&Ms and five small packages of Reese's Pieces (same net weight as the M&Ms). Test 
whether or not the mean number of candy pieces per package is the same for the two brands. 

1. Ho: 

2. H a : 

3. In words, define the random variable. 

4. What distribution should be used for this test? 

5. Calculate the test statistic using your data. 

6. Draw a graph and label it appropriately. Shade the actual level of significance. 

a. Graph: 
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Figure 10.15 

b. Calculate the p-value. 

7. Do you reject or not reject the null hypothesis? Why? 

8. Write a clear conclusion using a complete sentence. 

Shoe Survey 

Test whether women have, on average, more pairs of shoes than men. Include all forms of sneakers, shoes, sandals, 
and boots. Use your class as the sample. 

1. Ho: 

2. H a : 

3. In words, define the random variable. 

4. The distribution to use for the test is . 

5. Calculate the test statistic using your data. 

6. Draw a graph and label it appropriately. Shade the actual level of significance. 

a. Graph: 



Figure 10.16 

b. Calculate the p-value. 

7. Do you reject or not reject the null hypothesis? Why? 

8. Write a clear conclusion using a complete sentence. 



This content is available for free at https://cnx.Org/content/colll562/l.17 



CHAPTER 10 | HYPOTHESIS TESTING WITH TWO SAMPLES 555 



KEY TERMS 

Degrees of Freedom (df) the number of objects in a sample that are free to vary. 

Pooled Proportion estimate of the common value of pi and p 2 . 

Standard Deviation A number that is equal to the square root of the variance and measures how far data values are 
from their mean; notation: s for sample standard deviation and a for population standard deviation. 

Variable (Random Variable) a characteristic of interest in a population being studied. Common notation for variables 
are upper-case Latin letters X, Y, Z,... Common notation for a specific value from the domain (set of all possible 
values of a variable) are lower-case Latin letters x, y, z,.... For example, if X is the number of children in a family, 
then x represents a specific integer 0, 1, 2, 3, .... Variables in statistics differ from variables in intermediate algebra 
in two following ways. 

• The domain of the random variable (RV) is not necessarily a numerical set; the domain may be expressed in 
words; for example, if X = hair color, then the domain is {black, blond, gray, green, orange}. 

• We can tell what specific value x of the random variable X takes only after performing the experiment. 

CHAPTER REVIEW 

10.1 Two Population Means with Unknown Standard Deviations 

Two population means from independent samples where the population standard deviations are not known 

• Random Variable: X j — X 2 = the difference of the sampling means 

• Distribution: Student's t-distribution with degrees of freedom (variances not pooled) 

10.2 Two Population Means with Known Standard Deviations 

A hypothesis test of two population means from independent samples where the population standard deviations are known 
(typically approximated with the sample standard deviations), will have these characteristics: 

• Random variable: X j — X 0 = the difference of the means 

• Distribution: normal distribution 

10.3 Comparing Two Independent Population Proportions 

Test of two population proportions from independent samples. 

A A 

• Random variable: Pa~Pb~ difference between the two estimated proportions 

• Distribution: normal distribution 

10.4 Matched or Paired Samples 

A hypothesis test for matched or paired samples (t-test) has these characteristics: 

• Test the differences by subtracting one measurement from the other measurement 

• Random Variable: x d = mean of the differences 

• Distribution: Student’s-t distribution with n - 1 degrees of freedom 

• If the number of differences is small (less than 30), the differences must follow a normal distribution. 

• Two samples are drawn from the same set of objects. 
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Samples are dependent. 



FORMULA REVIEW 



10.1 Two Population Means with Unknown 
Standard Deviations 



Standard error: SE = — I- 



Test statistic (t-score): t 



n j 1 n 2 

= ( x 1 ~ x 2 )~(Vi ~P 2 ) 

Gj ) 2 U 2 ) 2 
«1 + "2 



Degrees of freedom: 

df 



(p) 2 (* 2 ) 2 

n i ' rtn 



(^r)f^r) + (^)l 



<s 2 y 



- 



where: 

si and S 2 are the sample standard deviations, and ni and n 2 
are the sample sizes. 

x i and x -> are the sample means. 

Cohen’s d is the measure of effect size: 



d= v~ X2 

^ pooled 



>hn l -l)sl+(n 2 -l)sl 

where s nnnl ,j = : 1 = - 

P° oled | n l +n 2 - 2 



10.2 Two Population Means with Known 
Standard Deviations 

Normal Distribution: 



X x -X 2 ~N 



>l) 2 , (<d) 2 

Pi - Pi’ 11 — m 1 



'1 



Generally pi -pi = 0. 

Test Statistic (z-score): 

7 = {x t - x 2 ) - (p, -p 2 ) 

( ff l) 2 (tr 2 ) 2 



Generally pi - p 2 = 0. 

where: 

cri and 02 are the known population standard deviations, ni 
and n 2 are the sample sizes, x 1 and x 2 are the sample 
means, pi and p 2 are the population means. 

10.3 Comparing Two Independent Population 
Proportions 

Pooled Proportion: p c = Xp 

n F + n M 

Distribution for the differences: 



p' a- p'b~ 4°’ i Pc(] - + i) 



where the null hypothesis is Ho : pa = PB or Ho : pa - PB 

= 0. 



Test Statistic (z-score): z — 



ip' A~ P' b) 



Pc0-~ Pc)(wx + wg) 



where the null hypothesis is Ho: pa = PB or Ho: pa ~ PB 

= 0 . 

where 

p'A and p'B are the sample proportions, pa and pb are the 
population proportions, 

P c is the pooled proportion, and ha and he are the sample 
sizes. 

10.4 Matched or Paired Samples 

x d~ Pd 



Test Statistic (t-score): t : 



where: 



(V) 

Vvn/ 



x d is the mean of the sample differences, pd is the mean 

of the population differences, sd is the sample standard 
deviation of the differences, n is the sample size. 



PRACTICE 



10.1 Two Population Means with Unknown Standard Deviations 

Use the following information to answer the next 15 exercises: Indicate if the hypothesis test is for 

a. independent group means, population standard deviations, and/or variances known 

b. independent group means, population standard deviations, and/or variances unknown 
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c. matched or paired samples 

d. single mean 

e. two proportions 

f. single proportion 

1. It is believed that 70% of males pass their drivers test in the first attempt, while 65% of females pass the test in the first 
attempt. Of interest is whether the proportions are in fact equal. 

2 . A new laundry detergent is tested on consumers. Of interest is the proportion of consumers who prefer the new brand 
over the leading competitor. A study is done to test this. 

3 . A new windshield treatment claims to repel water more effectively. Ten windshields are tested by simulating rain without 
the new treatment. The same windshields are then treated, and the experiment is run again. A hypothesis test is conducted. 

4 . The known standard deviation in salary for all mid-level professionals in the financial industry is $11,000. Company A 
and Company B are in the financial industry. Suppose samples are taken of mid-level professionals from Company A and 
from Company B. The sample mean salary for mid-level professionals in Company A is $80,000. The sample mean salary 
for mid-level professionals in Company B is $96,000. Company A and Company B management want to know if their mid- 
level professionals are paid differently, on average. 

5 . The average worker in Germany gets eight weeks of paid vacation. 

6 . According to a television commercial, 80% of dentists agree that Ultrafresh toothpaste is the best on the market. 

7 . It is believed that the average grade on an English essay in a particular school system for females is higher than for males. 
A random sample of 31 females had a mean score of 82 with a standard deviation of three, and a random sample of 25 males 
had a mean score of 76 with a standard deviation of four. 

8. The league mean batting average is 0.280 with a known standard deviation of 0.06. The Rattlers and the Vikings belong 
to the league. The mean batting average for a sample of eight Rattlers is 0.210, and the mean batting average for a sample 
of eight Vikings is 0.260. There are 24 players on the Rattlers and 19 players on the Vikings. Are the batting averages of the 
Rattlers and Vikings statistically different? 

9 . In a random sample of 100 forests in the United States, 56 were coniferous or contained conifers. In a random sample of 
80 forests in Mexico, 40 were coniferous or contained conifers. Is the proportion of conifers in the United States statistically 
more than the proportion of conifers in Mexico? 

10. A new medicine is said to help improve sleep. Eight subjects are picked at random and given the medicine. The means 
hours slept for each person were recorded before starting the medication and after. 

11 . It is thought that teenagers sleep more than adults on average. A study is done to verify this. A sample of 16 teenagers 
has a mean of 8.9 hours slept and a standard deviation of 1.2. A sample of 12 adults has a mean of 6.9 hours slept and a 
standard deviation of 0.6. 

12 . Varsity athletes practice five times a week, on average. 

13 . A sample of 12 in-state graduate school programs at school A has a mean tuition of $64,000 with a standard deviation of 
$8,000. At school B, a sample of 16 in-state graduate programs has a mean of $80,000 with a standard deviation of $6,000. 
On average, are the mean tuitions different? 

14 . A new WiFi range booster is being offered to consumers. A researcher tests the native range of 12 different routers 
under the same conditions. The ranges are recorded. Then the researcher uses the new WiFi range booster and records the 
new ranges. Does the new WiFi range booster do a better job? 

15 . A high school principal claims that 30% of student athletes drive themselves to school, while 4% of non-athletes drive 
themselves to school. In a sample of 20 student athletes, 45% drive themselves to school. In a sample of 35 non-athlete 
students, 6% drive themselves to school. Is the percent of student athletes who drive themselves to school more than the 
percent of nonathletes? 

Use the following information to answer the next three exercises: A study is done to determine which of two soft drinks 
has more sugar. There are 13 cans of Beverage A in a sample and six cans of Beverage B. The mean amount of sugar in 
Beverage A is 36 grams with a standard deviation of 0.6 grams. The mean amount of sugar in Beverage B is 38 grams with 
a standard deviation of 0.8 grams. The researchers believe that Beverage B has more sugar than Beverage A, on average. 
Both populations have normal distributions. 

16 . Are standard deviations known or unknown? 

17 . What is the random variable? 

18 . Is this a one-tailed or two-tailed test? 
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Use the following information to answer the next 12 exercises: The U.S. Center for Disease Control reports that the mean 
life expectancy was 47.6 years for whites born in 1900 and 33.0 years for nonwhites. Suppose that you randomly survey 
death records for people born in 1900 in a certain county. Of the 124 whites, the mean life span was 45.3 years with a 
standard deviation of 12.7 years. Of the 82 nonwhites, the mean life span was 34.1 years with a standard deviation of 15.6 
years. Conduct a hypothesis test to see if the mean life spans in the county were the same for whites and nonwhites. 

19. Is this a test of means or proportions? 

20. State the null and alternative hypotheses. 

a. Ho: 

b. H a : 

21. Is this a right-tailed, left-tailed, or two-tailed test? 

22. In symbols, what is the random variable of interest for this test? 

23. In words, define the random variable of interest for this test. 

24. Which distribution (normal or Student's t) would you use for this hypothesis test? 

25. Explain why you chose the distribution you did for Exercise 10.24. 

26. Calculate the test statistic and p-value. 

27. Sketch a graph of the situation. Label the horizontal axis. Mark the hypothesized difference and the sample difference. 
Shade the area corresponding to the p-value. 

28. Find the p-value. 

29. At a pre-conceived a = 0.05, what is your: 

a. Decision: 

b. Reason for the decision: 

c. Conclusion (write out in a complete sentence): 

30. Does it appear that the means are the same? Why or why not? 

10.2 Two Population Means with Known Standard Deviations 

Use the following information to answer the next five exercises. The mean speeds of fastball pitches from two different 
baseball pitchers are to be compared. A sample of 14 fastball pitches is measured from each pitcher. The populations have 
normal distributions. Table 10.18 shows the result. Scouters believe that Rodriguez pitches a speedier fastball. 



Pitcher 


Sample Mean Speed of Pitches (mph) 


Population Standard Deviation 


Wesley 


86 


3 


Rodriguez 


91 


7 



Table 10.18 



31. What is the random variable? 

32. State the null and alternative hypotheses. 

33. What is the test statistic? 

34. What is the p-value? 

35. At the 1% significance level, what is your conclusion? 

Use the following information to answer the next five exercises. A researcher is testing the effects of plant food on plant 
growth. Nine plants have been given the plant food. Another nine plants have not been given the plant food. The heights of 
the plants are recorded after eight weeks. The populations have normal distributions. The following table is the result. The 
researcher thinks the food makes the plants grow taller. 



Plant Group 


Sample Mean Height of Plants (inches) 


Population Standard Deviation 


Food 


16 


2.5 


No food 


14 


1.5 



Table 10.19 
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36. Is the population standard deviation known or unknown? 

37. State the null and alternative hypotheses. 

38. What is the p-value? 

39. Draw the graph of the p-value. 

40. At the 1% significance level, what is your conclusion? 

Use the following information to answer the next five exercises. Two metal alloys are being considered as material for 
ball bearings. The mean melting point of the two alloys is to be compared. 15 pieces of each metal are being tested. Both 
populations have normal distributions. The following table is the result. It is believed that Alloy Zeta has a different melting 
point. 





Sample Mean Melting Temperatures (°F) 


Population Standard Deviation 


Alloy Gamma 


800 


95 


Alloy Zeta 


900 


105 



Table 10.20 



41. State the null and alternative hypotheses. 

42. Is this a right-, left-, or two-tailed test? 

43. What is the p-value? 

44. Draw the graph of the p-value. 

45. At the 1% significance level, what is your conclusion? 

10.3 Comparing Two Independent Population Proportions 

Use the following information for the next five exercises. Two types of phone operating system are being tested to determine 
if there is a difference in the proportions of system failures (crashes). Fifteen out of a random sample of 150 phones with 

051 had system failures within the first eight hours of operation. Nine out of another random sample of 150 phones with 

05 2 had system failures within the first eight hours of operation. OS 2 is believed to be more stable (have fewer crashes) 
than OSi. 

46. Is this a test of means or proportions? 

47. What is the random variable? 

48. State the null and alternative hypotheses. 

49. What is the p-value? 

50. What can you conclude about the two operating systems? 

Use the following information to answer the next twelve exercises. In the recent Census, three percent of the U.S. population 
reported being of two or more races. However, the percent varies tremendously from state to state. Suppose that two random 
surveys are conducted. In the first random survey, out of 1,000 North Dakotans, only nine people reported being of two or 
more races. In the second random survey, out of 500 Nevadans, 17 people reported being of two or more races. Conduct 
a hypothesis test to determine if the population percents are the same for the two states or if the percent for Nevada is 
statistically higher than for North Dakota. 

51. Is this a test of means or proportions? 

52. State the null and alternative hypotheses. 

a. Ho: 

b. H a : 

53. Is this a right-tailed, left-tailed, or two-tailed test? How do you know? 

54. What is the random variable of interest for this test? 

55. In words, define the random variable for this test. 

56. Which distribution (normal or Student's t) would you use for this hypothesis test? 

57. Explain why you chose the distribution you did for the Exercise 10.56. 



560 CHAPTER 10 | HYPOTHESIS TESTING WITH TWO SAMPLES 



58. Calculate the test statistic. 

59. Sketch a graph of the situation. Mark the hypothesized difference and the sample difference. Shade the area 
corresponding to the p-value. 



P'n~ P'nd 



Figure 10.17 

60. Find the p-value. 

61. At a pre-conceived a = 0.05, what is your: 

a. Decision: 

b. Reason for the decision: 

c. Conclusion (write out in a complete sentence): 

62. Does it appear that the proportion of Nevadans who are two or more races is higher than the proportion of North 
Dakotans? Why or why not? 

10.4 Matched or Paired Samples 

Use the following information to answer the next five exercises. A study was conducted to test the effectiveness of a software 
patch in reducing system failures over a six-month period. Results for randomly selected installations are shown in Table 
10.21. The “before” value is matched to an “after” value, and the differences are calculated. The differences have a normal 
distribution. Test at the 1% significance level. 



Installation 


A 


B 


C 


D 


E 


F 


G 


H 


Before 


3 


6 


4 


2 


5 


8 


2 


6 


After 


1 


5 


2 


0 


1 


0 


2 


2 



Table 10.21 



63. What is the random variable? 

64. State the null and alternative hypotheses. 

65. What is the p-value? 

66. Draw the graph of the p-value. 

67. What conclusion can you draw about the software patch? 



Use the following information to answer next five exercises. A study was conducted to test the effectiveness of a juggling 
class. Before the class started, six subjects juggled as many balls as they could at once. After the class, the same six subjects 
juggled as many balls as they could. The differences in the number of balls are calculated. The differences have a normal 
distribution. Test at the 1% significance level. 



Subject 


A 


B 


C 


D 


E 


F 


Before 


3 


4 


3 


2 


4 


5 


After 


4 


5 


6 


4 


5 


7 



Table 10.22 



68. State the null and alternative hypotheses. 

69. What is the p-value? 

70. What is the sample mean difference? 

71. Draw the graph of the p-value. 

72. What conclusion can you draw about the juggling class? 
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Use the following information to answer the next five exercises. A doctor wants to know if a blood pressure medication is 
effective. Six subjects have their blood pressures recorded. After twelve weeks on the medication, the same six subjects 
have their blood pressure recorded again. For this test, only systolic pressure is of concern. Test at the 1% significance level. 



Patient 


A 


B 


C 


D 


E 


F 


Before 


161 


162 


165 


162 


166 


171 


After 


158 


159 


166 


160 


167 


169 



Table 10.23 



73. State the null and alternative hypotheses. 

74. What is the test statistic? 

75. What is the p-value? 

76. What is the sample mean difference? 

77. What is the conclusion? 

HOMEWORK 

10.1 Two Population Means with Unknown Standard Deviations 

DIRECTIONS: For each of the word problems, use a solution sheet to do the hypothesis test. The solution sheet is found in 
Appendix E. Please feel free to make copies of the solution sheets. For the online version of the book, it is suggested that 
you copy the .doc or the .pdf files. 



NOTE 

If you are using a Student's t-distribution for a homework problem in what follows, including for paired data, you may 
assume that the underlying population is normally distributed. (When using these tests in a real situation, you must 
first prove that assumption, however.) 



78. The mean number of English courses taken in a two-year time period by male and female college students is believed 
to be about the same. An experiment is conducted and data are collected from 29 males and 16 females. The males took an 
average of three English courses with a standard deviation of 0.8. The females took an average of four English courses with 
a standard deviation of 1.0. Are the means statistically the same? 

79. A student at a four-year college claims that mean enrollment at four-year colleges is higher than at two-year colleges 
in the United States. Two surveys are conducted. Of the 35 two-year colleges surveyed, the mean enrollment was 5,068 
with a standard deviation of 4,777. Of the 35 four-year colleges surveyed, the mean enrollment was 5,466 with a standard 
deviation of 8,191. 

80. At Rachel’s 11 th birthday party, eight girls were timed to see how long (in seconds) they could hold their breath in a 
relaxed position. After a two-minute rest, they timed themselves while jumping. The girls thought that the mean difference 
between their jumping and relaxed times would be zero. Test their hypothesis. 



Relaxed time (seconds) 


Jumping time (seconds) 


26 


21 


47 


40 


30 


28 


22 


21 



Table 10.24 
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Relaxed time (seconds) 


Jumping time (seconds) 


23 


25 


45 


43 


37 


35 


29 


32 



Table 10.24 



81. Mean entry-level salaries for college graduates with mechanical engineering degrees and electrical engineering degrees 
are believed to be approximately the same. A recruiting office thinks that the mean mechanical engineering salary is 
actually lower than the mean electrical engineering salary. The recruiting office randomly surveys 50 entry level mechanical 
engineers and 60 entry level electrical engineers. Their mean salaries were $46,100 and $46,700, respectively. Their 
standard deviations were $3,450 and $4,210, respectively. Conduct a hypothesis test to determine if you agree that the mean 
entry-level mechanical engineering salary is lower than the mean entry-level electrical engineering salary. 

82. Marketing companies have collected data implying that teenage girls use more ring tones on their cellular phones than 
teenage boys do. In one particular study of 40 randomly chosen teenage girls and boys (20 of each) with cellular phones, 
the mean number of ring tones for the girls was 3.2 with a standard deviation of 1.5. The mean for the boys was 1.7 with 
a standard deviation of 0.8. Conduct a hypothesis test to determine if the means are approximately the same or if the girls’ 
mean is higher than the boys’ mean. 

Use the information from Appendix C to answer the next four exercises. 

83. Using the data from Lap 1 only, conduct a hypothesis test to determine if the mean time for completing a lap in races is 
the same as it is in practices. 

84. Repeat the test in Exercise 10.83, but use Lap 5 data this time. 

85. Repeat the test in Exercise 10.83, but this time combine the data from Laps 1 and 5. 

86. In two to three complete sentences, explain in detail how you might use Terri Vogel’s data to answer the following 
question. “Does Terri Vogel drive faster in races than she does in practices?” 

Use the following information to answer the next two exercises. The Eastern and Western Major League Soccer conferences 
have a new Reserve Division that allows new players to develop their skills. Data for a randomly picked date showed the 
following annual goals. 



Western 


Eastern 


Los Angeles 9 


D.C. United 9 


FC Dallas 3 


Chicago 8 


Chivas USA 4 


Columbus 7 


Real Salt Lake 3 


New England 6 


Colorado 4 


MetroStars 5 


San Jose 4 


Kansas City 3 



Table 10.25 



Conduct a hypothesis test to answer the next two exercises. 

87. The exact distribution for the hypothesis test is: 

a. the normal distribution 

b. the Student's t-distribution 

c. the uniform distribution 

d . the exponential distribution 

88. If the level of significance is 0.05, the conclusion is: 

a. There is sufficient evidence to conclude that the W Division teams score fewer goals, on average, than the E 
teams 

b. There is insufficient evidence to conclude that the W Division teams score more goals, on average, than the E 
teams. 
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c. There is insufficient evidence to conclude that the W teams score fewer goals, on average, than the E teams score. 

d . Unable to determine 

89. Suppose a statistics instructor believes that there is no significant difference between the mean class scores of statistics 
day students on Exam 2 and statistics night students on Exam 2. She takes random samples from each of the populations. 
The mean and standard deviation for 35 statistics day students were 75.86 and 16.91. The mean and standard deviation 
for 37 statistics night students were 75.41 and 19.73. The “day” subscript refers to the statistics day students. The “night” 
subscript refers to the statistics night students. A concluding statement is: 

a. There is sufficient evidence to conclude that statistics night students' mean on Exam 2 is better than the statistics 
day students' mean on Exam 2. 

b. There is insufficient evidence to conclude that the statistics day students' mean on Exam 2 is better than the 
statistics night students' mean on Exam 2. 

c. There is insufficient evidence to conclude that there is a significant difference between the means of the statistics 
day students and night students on Exam 2. 

d. There is sufficient evidence to conclude that there is a significant difference between the means of the statistics 
day students and night students on Exam 2. 

90. Researchers interviewed street prostitutes in Canada and the United States. The mean age of the 100 Canadian 
prostitutes upon entering prostitution was 18 with a standard deviation of six. The mean age of the 130 United States 
prostitutes upon entering prostitution was 20 with a standard deviation of eight. Is the mean age of entering prostitution in 
Canada lower than the mean age in the United States? Test at a 1% significance level. 

91. A powder diet is tested on 49 people, and a liquid diet is tested on 36 different people. Of interest is whether the liquid 
diet yields a higher mean weight loss than the powder diet. The powder diet group had a mean weight loss of 42 pounds 
with a standard deviation of 12 pounds. The liquid diet group had a mean weight loss of 45 pounds with a standard deviation 
of 14 pounds. 

92. Suppose a statistics instructor believes that there is no significant difference between the mean class scores of statistics 
day students on Exam 2 and statistics night students on Exam 2. She takes random samples from each of the populations. 
The mean and standard deviation for 35 statistics day students were 75.86 and 16.91, respectively. The mean and standard 
deviation for 37 statistics night students were 75.41 and 19.73. The “day” subscript refers to the statistics day students. 
The “night” subscript refers to the statistics night students. An appropriate alternative hypothesis for the hypothesis test is: 

3. p day > Pnight 
b. pday < Pnight 
C. pday = Pnight 
d. pday ^ Pnight 

10.2 Two Population Means with Known Standard Deviations 

DIRECTIONS: For each of the word problems, use a solution sheet to do the hypothesis test. The solution sheet is found in 
Appendix E. Please feel free to make copies of the solution sheets. For the online version of the book, it is suggested that 
you copy the .doc or the .pdf files. 



NOTE 

If you are using a Student's t-distribution for one of the following homework problems, including for paired data, you 
may assume that the underlying population is normally distributed. (When using these tests in a real situation, you 
must first prove that assumption, however.) 



93. A study is done to determine if students in the California state university system take longer to graduate, on average, 
than students enrolled in private universities. One hundred students from both the California state university system and 
private universities are surveyed. Suppose that from years of research, it is known that the population standard deviations 
are 1.5811 years and 1 year, respectively. The following data are collected. The California state university system students 
took on average 4.5 years with a standard deviation of 0.8. The private university students took on average 4.1 years with a 
standard deviation of 0.3. 

94. Parents of teenage boys often complain that auto insurance costs more, on average, for teenage boys than for teenage 
girls. A group of concerned parents examines a random sample of insurance bills. The mean annual cost for 36 teenage boys 
was $679. For 23 teenage girls, it was $559. From past years, it is known that the population standard deviation for each 
group is $180. Determine whether or not you believe that the mean cost for auto insurance for teenage boys is greater than 
that for teenage girls. 
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95. A group of transfer bound students wondered if they will spend the same mean amount on texts and supplies each 
year at their four-year university as they have at their community college. They conducted a random survey of 54 students 
at their community college and 66 students at their local four-year university. The sample means were $947 and $1,011, 
respectively. The population standard deviations are known to be $254 and $87, respectively. Conduct a hypothesis test to 
determine if the means are statistically the same. 

96. Some manufacturers claim that non-hybrid sedan cars have a lower mean miles-per-gallon (mpg) than hybrid ones. 
Suppose that consumers test 21 hybrid sedans and get a mean of 31 mpg with a standard deviation of seven mpg. Thirty- 
one non-hybrid sedans get a mean of 22 mpg with a standard deviation of four mpg. Suppose that the population standard 
deviations are known to be six and three, respectively. Conduct a hypothesis test to evaluate the manufacturers claim. 

97. A baseball fan wanted to know if there is a difference between the number of games played in a World Series when 
the American League won the series versus when the National League won the series. From 1922 to 2012, the population 
standard deviation of games won by the American League was 1.14, and the population standard deviation of games won 
by the National League was 1.11. Of 19 randomly selected World Series games won by the American League, the mean 
number of games won was 5.76. The mean number of 17 randomly selected games won by the National League was 5.42. 
Conduct a hypothesis test. 

98. One of the questions in a study of marital satisfaction of dual-career couples was to rate the statement Tm pleased with 
the way we divide the responsibilities for childcare.” The ratings went from one (strongly agree) to five (strongly disagree). 
Table 10.26 contains ten of the paired responses for husbands and wives. Conduct a hypothesis test to see if the mean 
difference in the husband’s versus the wife’s satisfaction level is negative (meaning that, within the partnership, the husband 
is happier than the wife). 



Wife’s Score 


2 


2 


3 


3 


4 


2 


1 


1 


2 


4 


Husband’s Score 


2 


2 


1 


3 


2 


1 


1 


1 


2 


4 



Table 10.26 



10.3 Comparing Two Independent Population Proportions 

DIRECTIONS: For each of the word problems, use a solution sheet to do the hypothesis test. The solution sheet is found in 
Appendix E. Please feel free to make copies of the solution sheets. For the online version of the book, it is suggested that 
you copy the .doc or the .pdf files. 



NOTE 

If you are using a Student's t-distribution for one of the following homework problems, including for paired data, you 
may assume that the underlying population is normally distributed. (In general, you must first prove that assumption, 
however.) 



99. A recent drug survey showed an increase in the use of drugs and alcohol among local high school seniors as compared to 
the national percent. Suppose that a survey of 100 local seniors and 100 national seniors is conducted to see if the proportion 
of drug and alcohol use is higher locally than nationally. Locally, 65 seniors reported using drugs or alcohol within the past 
month, while 60 national seniors reported using them. 

100. We are interested in whether the proportions of female suicide victims for ages 15 to 24 are the same for the whites 
and the blacks races in the United States. We randomly pick one year, 1992, to compare the races. The number of suicides 
estimated in the United States in 1992 for white females is 4,930. Five hundred eighty were aged 15 to 24. The estimate for 
black females is 330. Forty were aged 15 to 24. We will let female suicide victims be our population. 



101. Elizabeth Mjelde, an art history professor, was interested in whether the value from the Golden Ratio formula, 
(larger + smaller dimension) 
larger dimension ) 



\ 



was the same in the Whitney Exhibit for works from 1900 to 1919 as for works from 1920 



to 1942. Thirty-seven early works were sampled, averaging 1.74 with a standard deviation of 0.11. Sixty-five of the later 
works were sampled, averaging 1.746 with a standard deviation of 0.1064. Do you think that there is a significant difference 
in the Golden Ratio calculation? 



102. A recent year was randomly picked from 1985 to the present. In that year, there were 2,051 Ffispanic students at 
Cabrillo College out of a total of 12,328 students. At Lake Tahoe College, there were 321 Flispanic students out of a total 
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of 2,441 students. In general, do you think that the percent of Hispanic students at the two colleges is basically the same or 
different? 

Use the following information to answer the next three exercises. Neuroinvasive West Nile virus is a severe disease that 
affects a person’s nervous system . It is spread by the Culex species of mosquito. In the United States in 2010 there were 
629 reported cases of neuroinvasive West Nile virus out of a total of 1,021 reported cases and there were 486 neuroinvasive 
reported cases out of a total of 712 cases reported in 2011. Is the 2011 proportion of neuroinvasive West Nile virus cases 
more than the 2010 proportion of neuroinvasive West Nile virus cases? Using a 1% level of significance, conduct an 
appropriate hypothesis test. 

• “2011” subscript: 2011 group. 

• “2010” subscript: 2010 group 

103. This is: 

a. a test of two proportions 

b. a test of two independent means 

c. a test of a single mean 

d. a test of matched pairs. 

104. An appropriate null hypothesis is: 

a. P2011 < P2010 

b. P2011 ^ P2010 
C. P2011 < P2010 
d. P2011 > P2010 

105. The p-value is 0.0022. At a 1% level of significance, the appropriate conclusion is 

a. There is sufficient evidence to conclude that the proportion of people in the United States in 2011 who contracted 
neuroinvasive West Nile disease is less than the proportion of people in the United States in 2010 who contracted 
neuroinvasive West Nile disease. 

b. There is insufficient evidence to conclude that the proportion of people in the United States in 2011 who 
contracted neuroinvasive West Nile disease is more than the proportion of people in the United States in 2010 
who contracted neuroinvasive West Nile disease. 

c. There is insufficient evidence to conclude that the proportion of people in the United States in 2011 who 
contracted neuroinvasive West Nile disease is less than the proportion of people in the United States in 2010 who 
contracted neuroinvasive West Nile disease. 

d. There is sufficient evidence to conclude that the proportion of people in the United States in 2011 who contracted 
neuroinvasive West Nile disease is more than the proportion of people in the United States in 2010 who contracted 
neuroinvasive West Nile disease. 

106. Researchers conducted a study to find out if there is a difference in the use of eReaders by different age groups. 
Randomly selected participants were divided into two age groups. In the 16- to 29-year-old group, 7% of the 628 surveyed 
use eReaders, while 11% of the 2,309 participants 30 years old and older use eReaders. 

107. Adults aged 18 years old and older were randomly selected for a survey on obesity. Adults are considered obese if 
their body mass index (BMI) is at least 30. The researchers wanted to determine if the proportion of women who are obese 
in the south is less than the proportion of southern men who are obese. The results are shown in Table 10.27. Test at the 
1% level of significance. 





Number who are obese 


Sample size 


Men 


42,769 


155,525 


Women 


67,169 


248,775 



Table 10.27 



108. Two computer users were discussing tablet computers. A higher proportion of people ages 16 to 29 use tablets than 
the proportion of people age 30 and older. Table 10.28 details the number of tablet owners for each age group. Test at the 
1% level of significance. 





16-29 year olds 


30 years old and older 


Own a Tablet 


69 


231 



Table 10.28 
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16-29 year olds 


30 years old and older 


Sample Size 


628 


2,309 



Table 10.28 



109. A group of friends debated whether more men use smartphones than women. They consulted a research study 
of smartphone use among adults. The results of the survey indicate that of the 973 men randomly sampled, 379 use 
smartphones. For women, 404 of the 1,304 who were randomly sampled use smartphones. Test at the 5% level of 
significance. 

110. While her husband spent 2Vi hours picking out new speakers, a statistician decided to determine whether the percent of 
men who enjoy shopping for electronic equipment is higher than the percent of women who enjoy shopping for electronic 
equipment. The population was Saturday afternoon shoppers. Out of 67 men, 24 said they enjoyed the activity. Eight of the 
24 women surveyed claimed to enjoy the activity. Interpret the results of the survey. 

111. We are interested in whether children’s educational computer software costs less, on average, than children’s 
entertainment software. Thirty-six educational software titles were randomly picked from a catalog. The mean cost was 
$31.14 with a standard deviation of $4.69. Thirty-five entertainment software titles were randomly picked from the same 
catalog. The mean cost was $33.86 with a standard deviation of $10.87. Decide whether children’s educational software 
costs less, on average, than children’s entertainment software. 

112. Joan Nguyen recently claimed that the proportion of college-age males with at least one pierced ear is as high as the 
proportion of college-age females. She conducted a survey in her classes. Out of 107 males, 20 had at least one pierced ear. 
Out of 92 females, 47 had at least one pierced ear. Do you believe that the proportion of males has reached the proportion 
of females? 

113. Use the data sets found in Appendix C to answer this exercise. Is the proportion of race laps Terri completes slower 
than 130 seconds less than the proportion of practice laps she completes slower than 135 seconds? 

114. "To Breakfast or Not to Breakfast?" by Richard Ayore 

In the American society, birthdays are one of those days that everyone looks forward to. People of different ages and peer 
groups gather to mark the 18th, 20th, . . ., birthdays. During this time, one looks back to see what he or she has achieved for 
the past year and also focuses ahead for more to come. 

If, by any chance, I am invited to one of these parties, my experience is always different. Instead of dancing around with 
my friends while the music is booming, I get carried away by memories of my family back home in Kenya. I remember the 
good times I had with my brothers and sister while we did our daily routine. 

Every morning, I remember we went to the shamba (garden) to weed our crops. I remember one day arguing with my 
brother as to why he always remained behind just to join us an hour later. In his defense, he said that he preferred waiting 
for breakfast before he came to weed. He said, “This is why I always work more hours than you guys!” 

And so, to prove him wrong or right, we decided to give it a try. One day we went to work as usual without breakfast, 
and recorded the time we could work before getting tired and stopping. On the next day, we all ate breakfast before going 
to work. We recorded how long we worked again before getting tired and stopping. Of interest was our mean increase in 
work time. Though not sure, my brother insisted that it was more than two hours. Using the data in Table 10.29, solve our 
problem. 



Work hours with breakfast 


Work hours without breakfast 


8 


6 


7 


5 


9 


5 


5 


4 


9 


7 


8 


7 


10 


7 


7 


5 



Table 10.29 
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Work hours with breakfast 


Work hours without breakfast 


6 


6 


9 


5 



Table 10.29 



10.4 Matched or Paired Samples 

DIRECTIONS: For each of the word problems, use a solution sheet to do the hypothesis test. The solution sheet is found in 
Appendix E. Please feel free to make copies of the solution sheets. For the online version of the book, it is suggested that 
you copy the .doc or the .pdf files. 



NOTE 

If you are using a Student's t-distribution for the homework problems, including for paired data, you may assume that 
the underlying population is normally distributed. (When using these tests in a real situation, you must first prove that 
assumption, however.) 



115. Ten individuals went on a low-fat diet for 12 weeks to lower their cholesterol. The data are recorded in Table 10.30. 
Do you think that their cholesterol levels were significantly lowered? 



Starting cholesterol level 


Ending cholesterol level 


140 


140 


220 


230 


110 


120 


240 


220 


200 


190 


180 


150 


190 


200 


360 


300 


280 


300 


260 


240 



Table 10.30 



Use the following information to answer the next two exercises. A new AIDS prevention drug was tried on a group of 224 
HIV positive patients. Forty-five patients developed AIDS after four years. In a control group of 224 HIV positive patients, 
68 developed AIDS after four years. We want to test whether the method of treatment reduces the proportion of patients that 
develop AIDS after four years or if the proportions of the treated group and the untreated group stay the same. 

Let the subscript t = treated patient and ut = untreated patient. 

116. The appropriate hypotheses are: 

a. Ho: pt < Put and H a : p t > put 

b . Ho: p t < pm and H a :pt> Put 

c. Ho: pt = Put and H a : pt * Put 

d . Ho: pt = Put and H a :pt< Put 

117. If the p-value is 0.0062 what is the conclusion (use a = 0.05)? 

a. The method has no effect. 

b. There is sufficient evidence to conclude that the method reduces the proportion of HIV positive patients who 
develop AIDS after four years. 
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c. There is sufficient evidence to conclude that the method increases the proportion of HIV positive patients who 
develop AIDS after four years. 

d. There is insufficient evidence to conclude that the method reduces the proportion of HIV positive patients who 
develop AIDS after four years. 

Use the following information to answer the next two exercises. An experiment is conducted to show that blood pressure 
can be consciously reduced in people trained in a “biofeedback exercise program.” Six subjects were randomly selected 
and blood pressure measurements were recorded before and after the training. The difference between blood pressures was 

calculated (after - before) producing the following results: x j = -10.2 sd = 8.4. Using the data, test the hypothesis that the 

blood pressure has decreased after the training. 

118. The distribution for the test is: 

a. ts 

b. ts 

c. N{- 10.2,8.4) 

d. N(-10.2, 

V6 

119. If a = 0.05, the p-value and the conclusion are 

a. 0.0014; There is sufficient evidence to conclude that the blood pressure decreased after the training. 

b. 0.0014; There is sufficient evidence to conclude that the blood pressure increased after the training. 

c. 0.0155; There is sufficient evidence to conclude that the blood pressure decreased after the training. 

d. 0.0155; There is sufficient evidence to conclude that the blood pressure increased after the training. 

120. A golf instructor is interested in determining if her new technique for improving players’ golf scores is effective. She 
takes four new students. She records their 18-hole scores before learning the technique and then after having taken her class. 
She conducts a hypothesis test. The data are as follows. 





Player 1 


Player 2 


Player 3 


Player 4 


Mean score before class 


83 


78 


93 


87 


Mean score after class 


80 


80 


86 


86 



Table 10.31 



The correct decision is: 

a. Reject Ho. 

b. Do not reject the Ho. 

121. A local cancer support group believes that the estimate for new female breast cancer cases in the south is higher in 
2013 than in 2012. The group compared the estimates of new female breast cancer cases by southern state in 2012 and in 
2013. The results are in Table 10.32. 



Southern States 


2012 


2013 


Alabama 


3,450 


3,720 


Arkansas 


2,150 


2,280 


Florida 


15,540 


15,710 


Georgia 


6,970 


7,310 


Kentucky 


3,160 


3,300 


Louisiana 


3,320 


3,630 


Mississippi 


1,990 


2,080 


North Carolina 


7,090 


7,430 


Oklahoma 


2,630 


2,690 



Table 10.32 
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Southern States 


2012 


2013 


South Carolina 


3,570 


3,580 


Tennessee 


4,680 


5,070 


Texas 


15,050 


14,980 


Virginia 


6,190 


6,280 



Table 10.32 



122. A traveler wanted to know if the prices of hotels are different in the ten cities that he visits the most often. The list 
of the cities with the corresponding hotel prices for his two favorite hotel chains is in Table 10.33. Test at the 1% level of 
significance. 



Cities 


Hyatt Regency prices in dollars 


Hilton prices in dollars 


Atlanta 


107 


169 


Boston 


358 


289 


Chicago 


209 


299 


Dallas 


209 


198 


Denver 


167 


169 


Indianapolis 


179 


214 


Los Angeles 


179 


169 


New York City 


625 


459 


Philadelphia 


179 


159 


Washington, DC 


245 


239 



Table 10.33 



123. A politician asked his staff to determine whether the underemployment rate in the northeast decreased from 2011 to 
2012. The results are in Table 10.34. 



Northeastern States 


2011 


2012 


Connecticut 


17.3 


16.4 


Delaware 


17.4 


13.7 


Maine 


19.3 


16.1 


Maryland 


16.0 


15.5 


Massachusetts 


17.6 


18.2 


New Hampshire 


15.4 


13.5 


New Jersey 


19.2 


18.7 


New York 


18.5 


18.7 


Ohio 


18.2 


18.8 


Pennsylvania 


16.5 


16.9 


Rhode Island 


20.7 


22.4 


Vermont 


14.7 


12.3 


West Virginia 


15.5 


17.3 



Table 10.34 
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BRINGING IT TOGETHER: HOMEWORK 

Use the following information to answer the next ten exercises, indicate which of the following choices best identifies the 
hypothesis test. 

a. independent group means, population standard deviations and/or variances known 

b. independent group means, population standard deviations and/or variances unknown 

c. matched or paired samples 

d. single mean 

e. two proportions 

f. single proportion 

124. A powder diet is tested on 49 people, and a liquid diet is tested on 36 different people. The population standard 
deviations are two pounds and three pounds, respectively. Of interest is whether the liquid diet yields a higher mean weight 
loss than the powder diet. 

125. A new chocolate bar is taste-tested on consumers. Of interest is whether the proportion of children who like the new 
chocolate bar is greater than the proportion of adults who like it. 

126. The mean number of English courses taken in a two-year time period by male and female college students is believed 
to be about the same. An experiment is conducted and data are collected from nine males and 16 females. 

127. A football league reported that the mean number of touchdowns per game was five. A study is done to determine if 
the mean number of touchdowns has decreased. 

128. A study is done to determine if students in the California state university system take longer to graduate than 
students enrolled in private universities. One hundred students from both the California state university system and private 
universities are surveyed. From years of research, it is known that the population standard deviations are 1.5811 years and 
one year, respectively. 

129. According to a YWCA Rape Crisis Center newsletter, 75% of rape victims know their attackers. A study is done to 
verify this. 

130. According to a recent study, U.S. companies have a mean maternity-leave of six weeks. 

131. A recent drug survey showed an increase in use of drugs and alcohol among local high school students as compared to 
the national percent. Suppose that a survey of 100 local youths and 100 national youths is conducted to see if the proportion 
of drug and alcohol use is higher locally than nationally. 

132. A new SAT study course is tested on 12 individuals. Pre-course and post-course scores are recorded. Of interest is the 
mean increase in SAT scores. The following data are collected: 



Pre-course score 


Post-course score 


1 


300 


960 


920 


1010 


1100 


840 


880 


1100 


1070 


1250 


1320 


860 


860 


1330 


1370 


790 


770 


990 


1040 


1110 


1200 



Table 10.35 
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Pre-course score 


Post-course score 


740 


850 



Table 10.35 



133. University of Michigan researchers reported in the Journal of the National Cancer Institute that quitting smoking is 
especially beneficial for those under age 49. In this American Cancer Society study, the risk (probability) of dying of lung 
cancer was about the same as for those who had never smoked. 

134. Lesley E. Tan investigated the relationship between left-handedness vs. right-handedness and motor competence in 
preschool children. Random samples of 41 left-handed preschool children and 41 right-handed preschool children were 
given several tests of motor skills to determine if there is evidence of a difference between the children based on this 
experiment. The experiment produced the means and standard deviations shown Table 10.36. Determine the appropriate 
test and best distribution to use for that test. 





Left-handed 


Right-handed 


Sample size 


41 


41 


Sample mean 


97.5 


98.1 


Sample standard deviation 


17.5 


19.2 



Table 10.36 



a. Two independent means, normal distribution 

b. Two independent means, Student’s-t distribution 

c. Matched or paired samples, Student’s-t distribution 

d . Two population proportions, normal distribution 

135. A golf instructor is interested in determining if her new technique for improving players’ golf scores is effective. She 
takes four (4) new students. She records their 18-hole scores before learning the technique and then after having taken her 
class. She conducts a hypothesis test. The data are as Table 10.37. 





Player 1 


Player 2 


Player 3 


Player 4 


Mean score before class 


83 


78 


93 


87 


Mean score after class 


80 


80 


86 


86 



Table 10.37 



This is: 




a. 


a test of two independent means. 


b. 


a test of two proportions. 


c. 


a test of a single mean. 


d. 


a test of a single proportion. 
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SOLUTIONS 

I two proportions 

3 matched or paired samples 
5 single mean 

7 independent group means, population standard deviations and/or variances unknown 
9 two proportions 

II independent group means, population standard deviations and/or variances unknown 
13 independent group means, population standard deviations and/or variances unknown 
15 two proportions 

17 The random variable is the difference between the mean amounts of sugar in the two soft drinks. 
19 means 
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21 two-tailed 

23 the difference between the mean life spans of whites and nonwhites 

25 This is a comparison of two population means with unknown population standard deviations. 

27 Check student’s solution. 

29 

a. Reject the null hypothesis 

b. p-value < 0.05 

c. There is not enough evidence at the 5% level of significance to support the claim that life expectancy in the 1900s is 
different between whites and nonwhites. 

31 The difference in mean speeds of the fastball pitches of the two pitchers 
33 -2.46 

35 At the 1% significance level, we can reject the null hypothesis. There is sufficient data to conclude that the mean speed 
of Rodriguez’s fastball is faster than Wesley’s. 

37 Subscripts: 1 = Food, 2 = No Food 
Ho: HI < H 2 
H a : hi > H2 

39 



Figure 10.18 




41 Subscripts: 1 = Gamma, 2 = Zeta 
Ho: hi = H2 
H a : pi * H2 

43 0.0062 

45 There is sufficient evidence to reject the null hypothesis. The data support that the melting point for Alloy Zeta is 
different from the melting point of Alloy Gamma. 

47 P'osi - R'oS 2 = difference in the proportions of phones that had system failures within the first eight hours of operation 
with OSi and OS 2 . 

49 0.1018 

51 proportions 

53 right-tailed 

55 The random variable is the difference in proportions (percents) of the populations that are of two or more races in 
Nevada and North Dakota. 
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57 Our sample sizes are much greater than five each, so we use the normal for two proportions distribution for this 
hypothesis test. 

59 Check student’s solution. 

61 

a. Reject the null hypothesis. 

b. p-value < alpha 

c. At the 5% significance level, there is sufficient evidence to conclude that the proportion (percent) of the population 
that is of two or more races in Nevada is statistically higher than that in North Dakota. 

63 the mean difference of the system failures 
65 0.0067 

67 With a p-value 0.0067, we can reject the null hypothesis. There is enough evidence to support that the software patch is 
effective in reducing the number of system failures. 

69 0.0021 

71 



Figure 10.19 




73 Ho: pd > 0 H a : pd < 0 
75 0.0699 

77 We decline to reject the null hypothesis. There is not sufficient evidence to support that the medication is effective. 

79 Subscripts: 1: two-year colleges; 2: four-year colleges 

a. Ho: pi > p2 

b. H a : !H < 112 

c. X j - X 2 is the difference between the mean enrollments of the two-year colleges and the four-year colleges. 

d. Student’s-t 

e. test statistic: -0.2480 

f. p-value: 0.4019 

g. Check student’s solution. 

h. i. Alpha: 0.05 

ii. Decision: Do not reject 

iii. Reason for Decision: p-value > alpha 

iv. Conclusion: At the 5% significance level, there is sufficient evidence to conclude that the mean enrollment at 
four-year colleges is higher than at two-year colleges. 

81 Subscripts: 1: mechanical engineering; 2: electrical engineering 
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a. Ho: pi > p2 

b. H a : in < p2 

c. X j — X 9 is the difference between the mean entry level salaries of mechanical engineers and electrical engineers. 

d. tl08 

e. test statistic: t = -0.82 

f. p-value: 0.2061 

g. Check student’s solution. 

h. i. Alpha: 0.05 

ii. Decision: Do not reject the null hypothesis. 

iii. Reason for Decision: p-value > alpha 

iv. Conclusion: At the 5% significance level, there is insufficient evidence to conclude that the mean entry-level 
salaries of mechanical engineers is lower than that of electrical engineers. 

83 

a. Ho: in = ]i 2 

b. H a : ]ii * p2 

c. X j — X 2 is the difference between the mean times for completing a lap in races and in practices. 

d. t20.32 

e. test statistic: -4.70 

f. p-value: 0.0001 

g. Check student’s solution. 

h. i. Alpha: 0.05 

ii. Decision: Reject the null hypothesis. 

iii. Reason for Decision: p-value < alpha 

iv. Conclusion: At the 5% significance level, there is sufficient evidence to conclude that the mean time for 
completing a lap in races is different from that in practices. 

85 

a. Ho: m = P 2 

b. H a : in * Ii2 

c. is the difference between the mean times for completing a lap in races and in practices. 

d. t40.94 

e. test statistic: -5.08 

f. p-value: zero 

g. Check student’s solution. 

h. i. Alpha: 0.05 

ii. Decision: Reject the null hypothesis. 

iii. Reason for Decision: p-value < alpha 

iv. Conclusion: At the 5% significance level, there is sufficient evidence to conclude that the mean time for 
completing a lap in races is different from that in practices. 



88 c 
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90 Test: two independent sample means, population standard deviations unknown. Random variable: X [ — X 9 

Distribution: Ho : pi = p2 H a : pi < p 2 The mean age of entering prostitution in Canada is lower than the mean age in the 
United States. 



p-value = 0.0151 



0 

Figure 10.20 

Graph: left-tailed p-value : 0.0151 Decision: Do not reject Ho. Conclusion: At the 1% level of significance, from the sample 
data, there is not sufficient evidence to conclude that the mean age of entering prostitution in Canada is lower than the mean 
age in the United States. 

92 d 

94 Subscripts: 1 = boys, 2 = girls 

a. Ho: pi < P2 

b. H a : pi > P2 

c. The random variable is the difference in the mean auto insurance costs for boys and girls. 

d. normal 

e. test statistic: z = 2.50 

f. p-value: 0.0062 

g. Check student’s solution. 

h. i. Alpha: 0.05 

ii. Decision: Reject the null hypothesis. 

iii. Reason for Decision: p-value < alpha 

iv. Conclusion: At the 5% significance level, there is sufficient evidence to conclude that the mean cost of auto 
insurance for teenage boys is greater than that for girls. 

96 Subscripts: 1 = non-hybrid sedans, 2 = hybrid sedans 

a. Ho: pi > P2 

b. H a : pi < p2 

c. The random variable is the difference in the mean miles per gallon of non-hybrid sedans and hybrid sedans. 

d. normal 

e. test statistic: 6.36 

f. p-value: 0 

g. Check student’s solution. 

h. i. Alpha: 0.05 

ii. Decision: Reject the null hypothesis. 

iii. Reason for decision: p-value < alpha 

iv. Conclusion: At the 5% significance level, there is sufficient evidence to conclude that the mean miles per gallon 
of non-hybrid sedans is less than that of hybrid sedans. 
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98 

a. Ho: pd = 0 

b. H a : lid < 0 

c. The random variable Xd is the average difference between husband’s and wife’s satisfaction level. 

d. tg 

e. test statistic: t = -1.86 

f. p-value: 0.0479 

g. Check student’s solution 

h. i. Alpha: 0.05 

ii. Decision: Reject the null hypothesis, but run another test. 

iii. Reason for Decision: p-value < alpha 

iv. Conclusion: This is a weak test because alpha and the p-value are close. However, there is insufficient evidence 
to conclude that the mean difference is negative. 

100 

a. Ho:Pw=Pb 

b. Ha-Pw^PB 

c. The random variable is the difference in the proportions of white and black suicide victims, aged 15 to 24. 

d. normal for two proportions 

e. test statistic: -0.1944 

f. p-value: 0.8458 

g. Check student’s solution. 

h. i. Alpha: 0.05 

ii. Decision: Reject the null hypothesis. 

iii. Reason for decision: p-value > alpha 

iv. Conclusion: At the 5% significance level, there is insufficient evidence to conclude that the proportions of white 
and black female suicide victims, aged 15 to 24, are different. 

102 Subscripts: 1 = Cabrillo College, 2 = Lake Tahoe College 

a. Ho: pi = p 2 

b. H a : pi * P2 

c. The random variable is the difference between the proportions of Hispanic students at Cabrillo College and Lake Tahoe 
College. 

d. normal for two proportions 

e. test statistic: 4.29 

f. p-value: 0.00002 

g. Check student’s solution. 

h. i. Alpha: 0.05 

ii. Decision: Reject the null hypothesis. 

iii. Reason for decision: p-value < alpha 

iv. Conclusion: There is sufficient evidence to conclude that the proportions of Hispanic students at Cabrillo College 
and Lake Tahoe College are different. 

104 a 

106 Test: two independent sample proportions. Random variable: p’i - p' 2 Distribution: 

Ho: pi = P 2 
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H a : pi * P2 The proportion of eReader users is different for the 16- to 29-year-old users from that of the 30 and older users. 
Graph: two-tailed 




Figure 10.21 

p-value : 0.0033 Decision: Reject the null hypothesis. Conclusion: At the 5% level of significance, from the sample data, 
there is sufficient evidence to conclude that the proportion of eReader users 16 to 29 years old is different from the 
proportion of eReader users 30 and older. 

108 Test: two independent sample proportions Random variable: p'i - p '2 Distribution: Ho: pi = p2 
H a : pi> P2 A higher proportion of tablet owners are aged 16 to 29 years old than are 30 years old and older. Graph: right- 
tailed 




0.2354 



Figure 10.22 

p-value: 0.2354 Decision: Do not reject the Ho. Conclusion: At the 1% level of significance, from the sample data, there is 
not sufficient evidence to conclude that a higher proportion of tablet owners are aged 16 to 29 years old than are 30 years 
old and older. 

110 Subscripts: 1: men; 2: women 

a. Ho: pi < P 2 

b. H a : pi > P2 

c. P \ ~ P' 2 i s the difference between the proportions of men and women who enjoy shopping for electronic equipment. 

d. normal for two proportions 

e. test statistic: 0.22 

f. p-value: 0.4133 

g. Check student’s solution. 

h. i. Alpha: 0.05 

ii. Decision: Do not reject the null hypothesis. 

iii. Reason for Decision: p-value > alpha 

iv. Conclusion: At the 5% significance level, there is insufficient evidence to conclude that the proportion of men 
who enjoy shopping for electronic equipment is more than the proportion of women. 

112 

a. Ho: pi = p2 
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b. H a \ pi * P2 

c. P' | — P' 2 is the difference between the proportions of men and women that have at least one pierced ear. 

d. normal for two proportions 

e. test statistic: -4.82 

f. p-value: zero 

g. Check student’s solution. 

h. i. Alpha: 0.05 

ii. Decision: Reject the null hypothesis. 

iii. Reason for Decision: p-value < alpha 

iv. Conclusion: At the 5% significance level, there is sufficient evidence to conclude that the proportions of males 
and females with at least one pierced ear is different. 

114 

a. Ho: pd = 0 

b. H a :pd> 0 

c. The random variable Xd is the mean difference in work times on days when eating breakfast and on days when not 
eating breakfast. 

d. tg 

e. test statistic: 4.8963 

f. p-value: 0.0004 

g. Check student’s solution. 

h. i. Alpha: 0.05 

ii. Decision: Reject the null hypothesis. 

iii. Reason for Decision: p-value < alpha 

iv. Conclusion: At the 5% level of significance, there is sufficient evidence to conclude that the mean difference in 
work times on days when eating breakfast and on days when not eating breakfast has increased. 

115 p-value = 0.1494 At the 5% significance level, there is insufficient evidence to conclude that the medication lowered 
cholesterol levels after 12 weeks. 

117 b 

119 c 

121 Test: two matched pairs or paired samples (t-test) Random variable: X d Distribution: ti 2 Ho: pd = 0 H a : pd > 0 The 

mean of the differences of new female breast cancer cases in the south between 2013 and 2012 is greater than zero. The 
estimate for new female breast cancer cases in the south is higher in 2013 than in 2012. Graph: right-tailed p-value: 0.0004 



= 0.0004 



0 




Figure 10.23 
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Decision: Reject Ho Conclusion: At the 5% level of significance, from the sample data, there is sufficient evidence to 
conclude that there was a higher estimate of new female breast cancer cases in 2013 than in 2012. 

123 Test: matched or paired samples (t-test) Difference data: {-0.9, -3.7, -3.2, -0.5, 0.6, -1.9, -0.5, 0.2, 0.6, 0.4, 1.7, -2.4, 

1.8} Random Variable: X d Distribution: Ho : pd = 0 H a : pd < 0 The mean of the differences of the rate of underemployment 

in the northeastern states between 2012 and 2011 is less than zero. The underemployment rate went down from 2011 to 
2012. Graph: left-tailed. 




Figure 10.24 

p-value: 0.1207 Decision: Do not reject Ho . Conclusion: At the 5% level of significance, from the sample data, there is not 
sufficient evidence to conclude that there was a decrease in the underemployment rates of the northeastern states from 2011 
to 2012. 
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